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Abstract. I discuss several corrections to leading twist calculations of nucleon structure functions 
which are needed to include experimental data at large parton fractional momentum x and at low 
scales in global fits of parton distribution functions. In particular 1 discuss the results of the 
CTEQ6X global fit, and some work in progress. Topics covered include the interplay of target mass 
and higher-twist corrections, the importance of nuclear corrections for deuterium target data, and 
applications to the study of quark-hadron duality. Implications for collider physics are highlighted. 
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Precise parton distribution functions (PDFs) at large parton fractional momentum x 
are vital for understanding the non perturbative structure of the nucleon and the effects 
of color confinement on its partonic constituents. For instance, the J/m quark distribution 
ratio near x = lis very sensitive to the nature of the quark-quark forces in the nucleon; 
the ratios of spin-polarized to spin-averaged PDFs Au/u, and particularly Ad/d, in the 
limit X — )■ 1 reflect the non perturbative quark-gluon dynamics in the nucleon, and can 
shed light on the origin of the nucleon's spin. Precise PDFs at large x also have impact in 
other areas of nuclear and high-energy physics, e.g., by allowing precise computations 
of QCD background processes in searches of new physics signals at hadron colliders, 
and systematic uncertainties in neutrino oscillation experiments. 

PDFs can be extracted from experimental data through global QCD fits which com- 
bine data from many different processes and observables, and analyze them by means of 
perturbative QCD calculations [1, 2, 3, 4]. Currently, however, the unpolarized PDFs are 
well determined only for x < 0.5 for valence quarks, x < 0.3 for gluons, and jc < 0. 1 for 
heavy quarks. To better constrain these at large x it is necessary to study hard scattering 
processes near kinematic thresholds, such as Deep Inelastic Scattering (DIS) at large 
Bjorken invariant xb and low 4-momentum transfer squared Q^, Drell-Yan (DY) lepton 
pair production and electroweak vector boson production at large rapidity. In these kine- 
matic regimes several corrections to leading twist perturbative QCD calculations can 
become important because of the rapid fall-off of the cross section near the kinematic 
boundary. Examples are target and jet mass corrections [5], threshold resummation [6], 
and higher-twists (HT) contributions [7, 8]. Moreover, data taken on nuclear targets must 
be corrected for nuclear effects such as shadowing, binding, Fermi motion and nucleon 
off-shellness, to access the partonic structure at the nucleon level. Accessing the highest 
values of x in DIS also requires understanding quark-hadron duality [9, 10] in order to 
utilize data in the resonance region. 

All these effects need to be incorporated in a consistent framework, simultaneously 
computed for a wide range observables, and utilized in a global PDF fit. The CTEQ6X 



global fit published in Ref. [11] took a first step in this program by considering the 
combined effect of TMC and HT corrections, alongside nuclear corrections for DIS data 
on deuterium targets needed for flavor separation of the up and down quark. In this talk, 
I discuss the results of this analysis and some recent work in progress which extends it. 
Detailed references can be found in Ref. [11]. 

TARGET MASS AND HIGHER-TWIST CORRECTIONS 

It is the usual practice in global PDF fits to place kinematic constraints on the DIS data 
sets, typically > 4 GeV^ and > 12 GeV^, so that only leading twist massless 
QCD contributions need be considered, thereby reducing the model dependent error on 
the extracted PDFs. As a byproduct of this procedure the PDFs are directly constrained 
by data only in the region x < 0.7. However, plentiful DIS data exist outside this 
region. In order to utilize them in global fits, one needs minimally to include Target 
Mass Corrections (TMCs), which scale as Mf^/Q^, with Myy the nucleon mass, and 
higher-twist corrections, which scale as with A a hadronic scale describing non 

perturbative parton-parton correlations inside the nucleon. 

Several methods are available in the literature to perform TMCs, and have been re- 
viewed by F. Steffens [5]. One is the well-known Georgi-Politzer formalism based on the 
Operator Product Expansion (OPE), whose results are also reproduced in the Covariant 
Parton Model. However, this formalism suffers from the problem that it leads to non- 
zero values of the structure function on a nucleon target in the unphysical region x > 1. 
Another prescription is a simple rescaling of the structure function, obtained by substi- 
tuting X with the Nachtmann variable =lx/{\ + 4x^M^ / 2^) • This also shares 
the above "unphysical region" problem. Lastly, working in CoUinear Factorization (CF) 
the kinematic boundaries are naturally respected. One advantage of the CF formalism 
versus the OPE formalism is that the former can be also applied to semi-inclusive DIS 
[12], and indeed to any hard-scattering process. An application to parity-violating DIS 
was discussed by T. Hobbs [13]. 

However, TMCs do not exhaust all possible 0{\/Q^) power corrections. These in- 
clude dynamical higher-twist corrections (parton correlations) as well as all uncontrolled 
leading-twist power corrections, such as Jet Mass Corrections [14]. They also include 
higher-order perturbative terms, which are logarithmic in but resemble a power law 
at low scales, and large-jc resummation effects [6]. Despite their disparate origin, it is 
customary to label these "residual" corrections as "higher-twist", as I will do here. In 
the CTEQ6X fits, these HT corrections are parameterized phenomenologically using a 
multiplicative factor modifying the structure function of the proton and the neutron: 

pdata ^ pTMC^^ ^ C{x)/Q^) , (1) 

where F™^ denotes the structure function after the target mass corrections have been 
made. The function C{x) is given by C{x) — ax^{l -\-cx). After inclusion of TMCs, 
this parameterization is sufficiently flexible to give a good description of the data. To 
simplify the global fits, the HT corrections for protons and neutrons where taken equal, 
given that their difference was found to be relatively small in other studies. 
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FIGURE 1. dependence of the Deuteron correction factor F^/{F^ + Fj" ) calculated with CTEQ6X 
PDFs, TMCs in the CF prescription, the fitted HT corrections, and the Paris deuteron wave function. 



NUCLEAR CORRECTIONS 

In order to separate the d and u quark at large x > 0.6 it is necessary to consider DIS 
data on deuteron targets, which are sensitive to a different linear combination of u and d 
quarks than the corresponding data on proton targets. However, at large x the deuteron 
deviates from a simple sum of a free proton and neutron due to significant effects of 
nuclear binding, Fermi motion and nucleon off-shellness [15]. 

Since the deuteron is weakly bound one can approximate the bound nucleon structure 
function by its on-shell value, and write the deuteron (d) structure function as 

F2'ix,Q') - L [dyf^/Ay,Y)F,'r^^'('^,QA. (2) 

N=p,n'' \y / 

Here F2N is the nucleon (proton p or neutron n) structure function including TMC and 
HT corrections. The "smearing function" /^^/^ is computed from the deuteron wave 

function and implements nuclear binding and Fermi motion corrections; at 2^ — )■ 0° it 
can be interpreted as the light-cone momentum distribution of nucleons in the deuteron. 
The variable y = {Mj/M!^){pf^ ■ l/Pd - o) is the deuteron's momentum fraction carried 
by the struck nucleon, where q is the virtual photon four-momentum, Pf^i^d) nucleon 
(deuteron) four-momentum, and M^/ the deuteron mass; it differs from the light-cone 
fractional momentum by terms of 0{M]^/Q^). Off-shell corrections to F2N can also be 
implemented in under a few assumptions, which are outside the scope of this talk. 

The deuteron correction factor / (F^" -|- F2) computed with the Paris wave function 
and the CTEQ6X PDFs is plotted in Figure 1 . It shows a remarkable 2^ dependence at 
< 20 GeV. Part of this dependence comes from the smearing function /a?//), which 
depends on the target mass through the variable 7^ = 1 +4.x;^M^/2^. However, this 
induces only minor effects on the deuteron correction factor. Most of the dependence 



2 
1.5 
1 

0.5 
2 

1.5 
1 

0.5 



J 1 M 1 M M 1 M M 1 1 
L U/U,ef 


(b): 


outO+TMC+HT 

— ciitO+nuc 

- cutO+nuc+TMC+Hl 




J 1 1 1 1 1 1 1 1 1 1 1 1 


1 1 1 1 1 L 


r Q2=10 GeV2 

=, , , , 1 , , , , 1 , . , , 1 , 





1.5 



0.5 



" ' I " ' ' I 



(a) 



cutO 
cutl 
cut2 
cut3 

I I I I 



/' 

q2=10 GeV^ (b) 



free 
dens 
nuc 
nuc+offsh 

I I I I I I I I I I 



0.2 0.4 0.6 0.8 1 0.2 0.4 0.6 0.8 1 
X X 



0.2 0.4 0.6 0.8 1 

X 



FIGURE 2. Left: PDF fits with standard DIS kinematic cuts and different combinations of TMC, HT 
and nuclear corrections. Right; sensitivity of the d PDF to kinematic cuts and nuclear corrections. Plots 
fromRef. [11]. 



shown in the figure is due to TMC and HT corrections at the nucleon level. 

It is also clear that nuclear smearing corrections do not disappear at large Q^: in 
general, they are not a subleading effect, and cannot be avoided by kinematics cuts such 
as those commonly used in global PDF fits. 



THE CTEQ6X PARTON DISTRIBUTIONS AT LARGE x 

The CTEQ6X global PDF fits [11] were performed at NLO to a wide variety of data 
similar to that used in the determination of the CTEQ6M1 PDFs except that no neutrino 
data were used since their use would involve additional nuclear corrections beyond those 
for deuterium. In addition, the E-866 dimuon data were added as were data for the CDF 
7+ jet production, the CDF W lepton asymmetry, and the D0 W asymmetry. 

Initially, a reference fit ("ref") was done using the standard W >3.5 GeV and Q> 2 
GeV (labeled cutO), with TMC, HT and nuclear corrections turned off in order to 
compare to the CTEQ6M1 PDFs. The E866 data favor at large x a slightly reduced u 
PDF and an increased d PDF; however in the latter case the W asymmetry and 7+jet 
data compensate the increase leaving the d PDF nearly unchanged. 

Subsequently, several prescriptions for TMC, HT and nuclear corrections were con- 
sidered, and the DIS kinematic cuts progressively relaxed ioW > \ .73 GeV and Q> 13 
GeV, in order to avoid most of the resonance region but be able to include a good number 
of Jefferson Lab data. This cut is labeled cut 3, with intermediate cuts labeled cutl 
and cut 2. As TMCs, the discussed OPE, (^-scaling and CF prescriptions were con- 
sidered. Nuclear smearing was performed using the Paris wave function with on-shell 
nucleon structure functions or off- shell corrections from the MST model [16]; the results 
were compared to fits obtained using either no nuclear corrections apart from isospin ef- 
fects, or nuclear corrections in the Density Model, which extrapolates the nuclear effects 
observed in heavier nuclei to the deuteron. 
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FIGURE 3. Sensitivity of the d PDF (left) and fitted HT term (right) to vaiious TMC prescriptions.Plots 
taken from Ref . [11]. 



The main results of the CTEQ6X analysis can be summarized as follows. 

• Standard kinematic cuts. When using the standard DIS kinematic cuts, W > 3.5 
GeV and Q>2 GeV, the PDFs are insensitive to TMC and HT corrections; however 
nuclear corrections are large and start at jc > 0.45, in a region well inside what is 
included in the cuts (Fig. 2, left). 

• Enlarged kinematic cuts. The PDFs are relatively stable against variations of the 
DIS cuts in the vicinity of the > 1 .73 GeV and Q > 1 .3 cut (Fig. 2, right plot, left 
panel). As a consequence of the enlarged data set, there is a substantial reduction 
in the uncertainty of these PDFs due to the increased data, with the cut 3 errors 
reduced by 10-20% for < 0.6, and by up to 40-60% at larger x. 

• Stability with respect to TMCs. The PDFs are nearly independent of the TMC 
prescription (Fig. 3, left); this is very important for fitting leading-twist PDFs. 
Changes in TMCs are absorbed by the phenomenological HT term, for which TMC 
modeling induces a non-negligible systematic uncertainty (Fig. 3, right). 

• Large sensitivity to nuclear corrections. The d PDF is very sensitive to the 
nuclear correction model adopted (Fig. 2, right plot, right panel). This induces a 
large systematic uncertainty, further discussed in the next section. 



NUCLEAR UNCERTAINTIES AND COLLIDER PHYSICS 

A detailed investigation of the systematic PDF uncertainties induced by nuclear correc- 
tions modeling is underway, as reviewed by J. Owens [1]. One very important result is 
that, surprisingly, the large-x gluon PDF is as sensitive to nuclear corrections as the d 
PDF; they are anticorrelated to each other due to an interplay of DIS, DY and jet data. 

This has potentially profound implications for future collider experiments, since the 
resulting variation of the gluon is significant even at values of x as low as 0.4. As an 




illustration, we can consider the parton luminosities, 
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where s (s) is the hadronic (partonic) center of mass energy squared, and // is the PDF for 
a parton of flavor i at = s. As an example, the gg luminosity controls the total main 
channel for Higgs production, the gd luminosity controls the "standard candle" cross 
section for W production, and the du luminosity is relevant to jet production. These 
are plotted in Fig. 4 for y/s = 1 TeV, relevant to the current LHC runs. The nuclear 
uncertainty grows quickly above 5-10% as exceeds 1 TeV. 



AN APPLICATION TO QUARK-HADRON DUALITY 

Quark-hadron duality in structure functions refers to the experimental observation that 
inclusive structure functions in the region dominated by low-lying nucleon resonances 
follow deep inelastic structure functions describing high energy data, to which the 
resonance structure functions average [9]. The new large-x CTEQ6X PDFs can be used 
to verify to what degree this holds true, which is important to understand the transition 
from the perturbative (partonic) to the non perturbative (hadronic) regime of QCD. 

The handbag diagram used in the pQCD computations assumes no interaction be- 
tween the scattered quark and the target remnant. This is a reasonable approximation 
only if their rapidity separation, Aj, is large enough. A value of Aj > 2 — 4, known as 
"Berger criterion" [18], should be sufficient to ensure applicability of pQCD. Since Aj 
decreases as .x; — )■ 1 or 2^ — )• 0, this limits the range in x and where the comparison 
of pQCD computations and resonance region data makes sense. 

In Figure 5 the ratios of Jefferson Lab data averaged over different resonance regions 
to computations using CTEQ6X are plotted [10, 17]. These are compared to calculations 
using Alekhin09 PDFs [19], which were fitted with similar TMC, HT and nuclear 
corrections as CTEQ6X. Only data satisfying a conservative Ay > 4 are retained. Data 
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FIGURE 5. Preliminary results of a comparison of averaged JLab F2 data over 4 "resonance regions" 
and computations using CTEQ6X (red) and Alekhin09 PDFs (blue). 



in the = (3.1,3.9) GeV^ region were included in the CTEQ6X fit, and indeed are 
well described. The = (2.5,3.1) GeV^ region, while not directly included in the 
fit, is constrained by DIS data at larger because of DGLAP evolution. Data in the 
— (1.9, 2.5) GeV^ region lie in the extrapolation region of the CTEQ6X fits. 
The plot shows that quark-hadron duality works within 5-10%. This opens the pos- 
sibility of using resonance region data to extend the range of validity in x of the fits. A 
study is underway [17] to further explore this issue, e.g., by determining whether duality 
holds for smaller values of the Ay cut, which would enlarge the number of data points, by 
evaluating nuclear systematic uncertainties and by quantifying PDF uncertainties from 
the HT parameters, especially in the extrapolation region. 



CONCLUSIONS 

I have shown that a good control of global PDF fits can be achieved when including DIS 
data in the pre- asymptotic region of large x and small Q^, if one considers TMC and HT 
corrections. The resulting PDFs are stable against available TMC prescriptions, with the 
modeling uncertainty absorbed in the phenomenologically extracted HT terms. This is 
very good for applications to collider and neutrino physics [20], and for comparing PDF 
moments to lattice calculations [21, 22]. 

Theoretical nuclear corrections to deuteron target data are necessary for u and d quark 
separation atx> 0.5. The J-quark and, surprisingly, the gluon PDF turn out to be very 



sensitive to uncertainties in the modeling and calculation of nuclear effects. Furthermore, 
the induced uncertainty on parton luminosities at the LHC is non-negligible. 

A careful study shows that the induced systematic rf-quark PDF uncertainty is of the 
same order as the experimental uncertainty if the deuteron target data are removed from 
the fit [1]. Therefore, further progress in constraining the d quark and the gluon PDFs 
at large x requires a better theoretical understanding of nuclear corrections, combined 
with new data on free proton, but sensitive to d, such as from parity violating DIS or 
from neutrino DIS on a hydrogen target [20]. Alternatively one can use data minimiz- 
ing nuclear corrections, such as from proton-tagged DIS on deuteron targets, for which 
the nuclear uncertainty is smaller than 2% [23]. Using quark-hadron duality to include 
resonance region data in the fits, thereby extending their x range, seems also feasible. Fi- 
nally, data sensitive to large x gluons, such as from the longitudinal and charm structure 
functions, Fl and are required to constrain the gluons independently of the jet data. 
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